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ESTIMATING AVERAGES OF ORDER STATISTICS OF 
BIVARIATE FUNCTIONS 

RICHARD LECHNER, MARKUS PASSENBRUNNER, AND JOSCHA PROCHNO 


Abstract. We prove uniform estimates for the expected value of averages 
of order statistics of bivariate functions in terms of their largest values by a 
direct analysis. As an application, uniform estimates for the expected value 
of averages of order statistics of sequences of independent random variables in 
terms of Orlicz norms are obtained. In the case where the bivariate functions 
are matrices, we provide a “minimal” probability space which allows us to C- 
embed certain Orlicz spaces t^ into , c, C > 0 being absolute constants. 
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1. Introduction and main results 

In the series of papers m dig nan], sequences of random variables and their 
order statistics were studied in several different settings and the obtained results 
were applied successfully to the local theory of convex bodies. In [ 7 ], the authors 
studied expressions of the form 

i 

E V'' k-max |a;,:Ab|, 1 < t < n, (1.1) 

' l<z<n 
k =1- 

with independent identically distributed (iid) random variables Xi, i = 1,..., n and 
real numbers Xi, i = 1 Here, k-maxi <j<„Xj(o;) is the fc-th order statistic 

of a statistical sample of size n, which is equal to its &-th largest value. Besides 
being fundamental tools in statistics with various applications in, e.g., compressed 
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sensing mm, wireless networks m, and data streams [32], order statistics of 
random samples appear naturally in Banach space theory, e.g., in computations 
of the distribution of eigenvalues of random matrices pi mi, and in calculating 
sharp bounds for the expectation of the supremum of Gaussian processes indexed 
by certain interpolated bodies [6] . For general information on order statistics, we 
refer the reader to [5] . 

Especially we would like to point out that the results that were obtained in [7], 
in particular the estimates for (1.1), were crucial to obtain estimates for various 
parameters associated to the local theory of convex bodies [8], e.g., type and cotype 
constants, p-summing norms, volume ratios, and projection constants. 

An integral tool in m and thus in [amain] are combinatorial estimates going 
back to S. Kwapien and C. Schiitt mm- Those estimates relate an average (over 
the group of permutations) of the largest order statistic of a matrix a to the average 
of its largest entries. To be more precise, it was shown that for all a £ R nx " 


1 

n! 


E 

7rG©n. 


max 

Ki<n 


r{i) 


- -E s ( fc )> 

n L ' 


k =1 


where s(k) is the A'-tli largest entry of the matrix consisting of the absolute values 
of a and 6 n is the symmetric group on {1,... ,n}. In TB], this was established in 
the following setting: under some assumptions on the normalized counting measure 
P on a collection G of maps from I = {1,..., n} to J = {1,..., N}, we have that 
for every matrix a £ W nxN and every £ < n, 

m 

( 1 - 2 ) 


M 


N ^ 

3 =1 


SU) < / 

Jg 


^ C 

k-max|a ig(i) |dP( 5 ) < — 

1 <i<n /v 


k =1 


N 

3 = 1 


where c and C are positive constants only depending on G. Special choices for G 
so that (1.2) holds, include the symmetric group & n and {1,...,Those 
estimates were then used to deduce similar combinatorial estimates for £ p norms. 

In this work we extend our results from m and study averages of order statistics 
of bivariate functions a : {1,..., n} x Q —> R, where (r2,J^,/j,) is an arbitrary 
probability space. In this setting, G will be a collection of maps from I = {1,... ,n} 
to the probability space (f 1,^,/j). We denote the decreasing rearrangement of a 
by a*. Our main result is the following: 

Theorem 1.1. Let n £ N, G be a collection of maps from I = {1,... ,n} to the 
probability space ($7,/i), Cq > 1 be a constant only depending on G, and P be a 
probability measure on G. Assume that for all i £ I, all different indices ii ,*2 £ I 
and all measurable sets A, A±, A 2 C LI, 

(i) F(g(i) £ A) = //(A), 

(ii) P(g(*i) S A 1 ,g(i 2 ) £ A 2 ) < C' G /r(Ai)/i(A 2 ). 

Then, for every measurable function a : I x LI —► K. and for every £ < n, 


■ / a*(t)dt,< / yk-max|«(i, 5 (i))|dP( 5 )<C - / a*(t)dt, (1.3) 
Jo Jg y; !<*<« Jo 


where 1/c = 48(1 + 2Cg) 2 and C = 6(1 + 2Cg). 


As a direct consequence, when the bivariate functions are matrices and P is 
the normalized counting measure on G, we obtain one of the main results in m 
Theorem 1.1]: 
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Corollary 1.2. Let n,N £ N and a £ R nxN . Let G be a collection of maps from 
I = {1,... ,n} to J = {1,..., N} and C G > 0 be a constant only depending on G. 
Assume that for all i £ I, j £ J and all different pairs (ii,ji), (* 2 , 12 ) £ I x J 
( 1 ) P({g£G:g(i)=j}) = l/N, 

(ii) P ({g £ G : g(i 1 ) = ji,g(i 2 ) = jz}) < C G /N 2 . 


Then, for every t < n, 


c 

N 



m 


(1.4) 


where l/c= 48(1 + 2 C G ) 2 and C = 6(1 + 2Cg). 


In this work we also present an example of a set of maps, say Go, with a minimal 
number of elements satisfying conditions ([ 1 ]) and ([ii]) in Corollary 1.2 thus guaran¬ 
teeing that the inequalities in (1.4) hold. When N = n, the cardinality of Go is n 2 . 
The set of maps provided here is based on finite fields of n elements, where n is a 
power of a prime nu mber . It is not hard to see that if G and P satisfy conditions ||i| 
and 


(|iij) of Corollary 
elements. 


1.2 


for some constant C G > 1, then G consists of at least 


Cg 


We then apply this result to obtain the following: 


Theorem 1.3. There exist constants c, G > 0 such that for all n £ N and every 
strictly convex, twice differentiable Orlicz function M : [0, 00 ) —>• [0, 00 ) that is 

C 3 

strictly 2-concave and satisfies M*{ 1) = 1, we have that c —>• if 1 ■ 


We also provide an application of Theorem o to sequences of iid random vari¬ 
ables to obtain estimates for (1.1). Those estimates are in terms of Orlicz norms 
and recover Corollaries 2 and 3 of j?J. To be more precise, we prove the following: 


Theorem 1.4. Let X\, ..., X n be a sequence of iid random variables with E|Xi| < 
00 . Let 1 < i < n and Ad be the N-function given by 

M*(J* X*(z)dz) = 0 < /? < 1. (1.5) 

Then, for all x £ M", 


l 

c\\x\\m < E k-max \xiXi\ < C\\x\\ M , 

AJ l<i<n 
k =1- 


where c, G > 0 are absolute constants. 


Let M* be given as in ( |1.5[ ). Then, for all s > 0, 

|X|dPd t. 


M(s) = f j 
Jo J 1 : 


0 J\x\>i/{tt) 


For i = 1, this was shown in m pp. 4-5]. A simple calculation shows that it holds 
for general £ as well. Therefore, we indeed obtain Corollaries 2 and 3 of [7]. 

While in [7J the proof involves estimates for the largest order statistic of a matrix 
and makes use of combinatorial results of mm in a crucial way, our approach is 
based on a purely probabilistic and direct analysis of <o> (Theorem , and is 
interesting in its own right. 
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The organization of the paper is as follows. Section [5] serves the purpose of 
introducing notations and preliminary results that we use throughout the paper, 
where the measure theoretic ones are especially important for the proof of the main 
theorem. Section [3] contains the proof of Theorem |1.1| This is done by reducing the 
problem to the case of functions only taking values in {0,1} and showing the result 
for this subclass of functions. Section [4] contains the application of Theorem |1.1| to 
sequences of iid random variables and thus the proof of Theorem |1.4| In Secti on [5| 
we present the minimal set of maps Go needed to guarantee the inequalities in (1.4) 


and give a proof of the embedding result Theorem 1.3 


2 . Notation and preliminaries 

Given a random variable X : Q —> K. on a measure space (f2, we define 

its distribution function F x (t) := /i({w G SI : |A'(w)| > t}), t > 0. The decreasing 
rearrangement of X is then defined for allt > 0 by 

X*(t) = inf{s > 0 : F x (s ) < t}, 

where we use the convention that inf0 = oo. Note that if F x is continuous and 
strictly decreasing, then X* is simply the inverse of F x . Moreover, notice that X 
and X* are equimeasurable, i.e., 

H(X G A) = A(A* G A) 

for all measurable subsets 4cl, where A denotes the Lebesgue measure. By 
we denote the characteristic function of a set A. 

A convex function M : [0, oo) -A [0, oo) is called an Orlicz function, if M{ 0) = 0 
and if M is not constant. An Orlicz function (as we define it) is bijective and 
continuous on [0, oo). Given an Orlicz function M. we define its conjugate function 
M* via the Legendre transform, 

M*(x)= sup (xt— 

[0,oo) 

We have M = M**, since the Legendre transform is an involution. Note that M* 
is again an Orlicz function if M is an Wfunction, i.e., if additionally 

lim M{x)/x = 0 and lim M(x)/x = oo. 

x—tO x—>oo 

Given an Orlicz function M and a measure space (T, ip) , the Orlicz space l m{ r) 
is the space of all (equivalence classes of) measurable, real valued functions / on T 
such that 

J M(\f\/\) dtp < oo, 

for some A > 0. We ecpip Lm( T) with the Luxemburg norm 

H/IIm = inf |A > 0 : j M(|/|/A) diP < l|. 

The closed unit ball of the space Lm will be denoted by Bm■ Note also that if M 
is an Wfunction, we have 


||/||m < sup f ■ g dV’ < 2 ||/||m- 
seb m * Jr 

For a detailed and thorough introduction to Orlicz spaces, cf. eg. 
Another result we will use is Paley-Zygmund’s inequality: 


(2.1) 


or 
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Theorem 2.1 (Paley-Zygmund). For every non-negative random variable Z and 
every number 0 < 6 < 1, we have 

P(Z>e-EZ)>(l-9 ) 2 ^§f. 

Moreover, we will need the following measure theoretic results: 

Theorem 2.2 (Sierpihski). Let [R.Sft.p) be a non-atomic measure space with 
p(R) = c. Then there exists a function f : [0, c] —>• Stf satisfying 

(i) f(t ) C f(s) for 0 < t < s < c, 

(ii) p{f(s)) = s for 0 < s < c. 

Sierpinski’s theorem allows us to construct to a given measurable function a new 
one that is constant only on sets of measure zero and which has the same ordering. 

Proposition 2.3. Let (R,3£,p) be a finite measure space. For every measurable 
function a : R —> [0, oo) there exists a measurable function b : R —> [0, oo) with the 
following properties: 

(i) for all x £ [0, oo) we either have p(b = x) = 0 or {b = x} is an atom. 

(ii) for all s,t £ R, we have a(s) > aft) implies that b(s) > b(t). 

Proof. Before we begin with the construction of the function b satisfying properties 
([if and (Jn|, we sketch its idea. 



Figure 1. Construction of b. 

First, we consider the sets {a = tj} for those tj such that p(a = tj) > 0. We 
decompose each of those sets into atoms Aj t k of 3% and a continuous part Bj. We 
define our function b on {a = tj} in such way that it takes different values on each 
of the atoms Aj^. On the continuous part Bj. it is defined in such a way that 
p({b = t} 0 Bj) = 0 for all t, where we use Sierpinski’s theorem. 

Let (tj)j(^.yv be the decreasing sequence of all numbers t such that 

p(a = t)> 0. 

Note that there are at most countably many t's with this property, i.e., we can 
choose JZ = A^} for some N £ N or jV = N. Additionally, we set to = oo 
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and define the sets 


Cj = {tj < a < tj-i}, j £ JT and D = {a < inf tj}. 

jejy 

First, we specify the function b on the sets Cj, j £ <jY and D by 

b(s) = a(s) + dj, s £ Cj and b(s) = a(s), s £ D, (2.2) 

where dj = J} i>j 2~ l . Note that we have 

ie-A' 

{ a = tj}= |J A. jM U B :j , j£^C, 

j 


where {Aj^ k } are atoms, Bj is non-atomic and „#j is either {1,..., Mj} or N. The 
offset dj introduced in (2.21 allows us now to define b on {a = tj} such that it takes 
different values on each of the atoms A ]k and such that p({b = t} HBj) =0 for all 
t. Second, we define 


b{s) — a(s) + dj -|-i + 


dj dj + 1 




s £ A 


j,ki 


on each of the atoms A ] k . In order to define b on the remainder Bj of the set 
{a = tj} we the invoke Sierpinski’s Theorem to obtain an increasing function fj : 
[0 ,p(Bj)] —► & such that p(fj(y )) = y for all y £ [0 ,p(Bj)\ and define 


b{s) = a(s) + d j+ 1 + dj ^ J+1 + • inf {a : f 3 {a) 9 s}, s G Bj. 

Since b satisfies condition ([h]) by construction, it is left to show that it satisfies |i]) 
as well. To this end, let x £ [0, oo). Note that 


{b = x} = {a = x — dj}, whenever {6 = cc} D Cj ^ 0. 


Since tj < x — dj < tj-i, we know p({b = x}) = 0. Furthermore, observe that if 
{b = x} fl D ^ 0, then p(b = x) = p(a = x) = 0. Second, note that if there exist 
indices j £ JV and k £ J(j such that {b = x} n Aj tk ^ 0, then {b = x} = Aj tk by 
construction. Since A, j k is an atom, so is {b = x}. Third, assume there exists an 
index j £ JA such that {b = x} fl Bj ^ 0. Note that by construction {b = a;} C Bj, 
thus there exists some number x' such that 


{b = x} = {z £ Bj : inf{a : fj{a) 9 z} = x'}. 

Hence, for all e > 0 we have 

{b = x} C fj(x' +e)\ fj(x' - e), 

and, by the properties of fj, we conclude p(b = x) = 0. □ 


Lemma 2.4. Let ([0 ,a),3§,n) be a finite, signed measure space satisfying 

k[0,*)>0, 0 < t < a. (2.3) 

Let f £ L i(k) be non-negative and decreasing. Then 



f(t)dn{t) > 0. 
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Proof. First, observe that it is enough to show the assertion of the lemma for simple 
functions / of the form 

m 

f = Y^ 1 b 3' 

i=i 

where fj > 0 is decreasing and {Bj} is a partition of [0, a) of measurable sets such 
that sup Bj = inf Bj + 1 , 1 < j < m— 1. Note that these conditions imply that Bj is 
a connected subset of [0, a). We define gj = K(Ui=i B i), 1 < j < m and g 0 = 0 and 
observe that (2.3) implies gj > 0. This is true since Ui=i is either [0, supBj) 
or [0,supBj] and «[0,t] = lim„ k[0 ,t + 1/n) > 0 for all 0 < t < a. Using partial 
summation we see 

,, mm 

/ fit) dn(t) = Y fj K i B j) = Y foidj - 9j- 1 ) 

• 7 ) 0 , a) 


3 = 1 


3 =1 


— fm9m ^ ] (/j fj—l)9j—l — 0) 
i=i 

since / > 0 is decreasing and g, > 0 as noted before. 


□ 


3. Proof of the main theorem 

The purpose of this section is to prove Theorem o We start with some neces¬ 
sary definitions and lemmata. 

3.1. Preparatory definitions and results. We define the measure space 

(S, S, a) := ({1,..., n} x ft, ^({1,..., n}) 6 <g> /i), 

where is the probability space from Theorem 5 is the counting mea¬ 

sure on {1,..., n} and ^({1,..., n}) denotes the power set of {1,..., n}. Observe 
that <7(S') = n and a measurable subset A C S is an atom in S if and only if 

A = {*} x A! up to a cr-null set for some 1 < i < n and some atom A! in f 1. 

Let a : S —> R be a measurable function with respect to Lebesgue measure on R, 
which will be fixed throughout the entire section. Note that, without restriction, 
we assume that a is non-negative. We apply Proposition |2.3| to the function a and 
obtain a measurable function b : S -A [0, oo) with the following properties: 

(i) for all x £ [0, oo) we either have a(b = x) = 0 or {b = i} is an atom, 

(ii) for all s,t £ S, we have a(s) > a(f) implies that b(s) > b(t). 

We define the set of all measurable functions on S that are ordered in the same 
way as b by 

stb := {d : S —>-[0, oo) measurable | 

b{x) < b(y) => d{x) < d(y) for all x,y £ 5}. ^ ^ 

This means that, if b is constant on some set B , then any d £ is constant on B 

as well. Note that in general, d £ sAb may be constant on some set Bq, where b is 
not. 

Moreover, we define the function h : [0, oo) —> S by 

OO 

Kt) == [j{b > b*(t-l/j)}. 

3=1 
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Roughly speaking, h(t) is the subset of S having approximately measure t, on 
which b takes its largest values. We single out those parameter values t such that 
a(h(t)) = t by setting 

U := {t £ [0, n] : a(h(t)) = t}. 

Since U plays an important role in what follows, we first investigate some of its 
properties. 

Lemma 3.1. The set U has the following properties: 

(i) n £ U, 

(ii) For allt £ [0,n] we have thatt £ U c if and only if there exists an open interval 
V 9 t such that b* is constant on V. 

(Hi) U is closed. 

(iv) If (c,d) C U c , then b* is constant on ( c,d ). 

Proof. ([I]) By definition of h, and since b and b* are equimeasurable, we have for all 
positive integers j: 

v[h(n)) > a(b > b*(n - l/j)) = A (b* > b*(n - l/j)) >n - 1/j, 

i.e., cr(h(rt)) > n. On the other hand, a(h{n)) < cr{S) = n. 

<§ Let t £ U c . Thus, there exists an index j satisfying 

a(h(t)) > a(b > b*(t — l/j)) > t. 

This implies that A (b* > b*(t — l/j)) > t and so there exist two points to,ti with 
to < t < ti such that b*(t 0 ) = b*(ti). But since b* is decreasing, b* is constant on 
(to, ti). On the other hand, let b* be constant on the interval (f —2 e,t + 2s). This 
implies that h{t) = h(t + e) and so, 

cr(h(t)) = a(h(t + e)) > t + £ > t, 

i.e., t £ U c . 

(iii I This is an immediate consequence of (Juj). 

(iv) Finally, let c < d with (c, d) C U c and I C (c, d) be a compact interval. For 
every t £ / we use |il]) to choose an open interval V (t) containing t on which b* is 
constant. By compactness, b* is constant on J, and since / was arbitrary, ( frv| ) is 
proved. □ 

We now recall the assumptions of Theorem o The collection G is a subset of 
all mappings from {1,..., n} —► ft and P is a probability measure on G satisfying 

(i) P (g{i) £ A) = fj,(A), 

(ii) P(g{ii) £ A 1: g(i 2 ) £ A 2 ) < CGp(A 1 )fi(A 2 ) 

for all i £ /, all different indices ii,i 2 £ /, and all measurable sets A,Ai,A 2 C f 1. 
Next, for all t £ [0,n], we define the random variable 

Xt : G —> {0,... ,n}, g \g n h(t)\, 

where | • | denotes the cardinality and we interpret g as the graph of g , i.e. gC\h{t) = 
{(«, <?(*)) £ h(t) : i £ I}. Observe that the definition of h and property ([iv]) of the 
above lemma imply that for t £ [0, n] : 

X t = X Uo , where uq = infjrt £ U : u> t}. 

We will now study some properties of these random variables that are essential for 
the proof of Theorem o 
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Proposition 3.2. The random variables (X t ) have the following properties: 

EX t = <j(h(t)), tg [0,n], 

EX? < t(l + C G t), teu. 

Proof. Let t G [0,n]. We have X t (g) = J27=i Y i(9), with Yf(g) := \{(i,g{i))} n 
h{t )| G {0,1} and, since h(t) is a measurable subset of S , we can write 

n 

h (t) = UW X A i 

with some measurable sets A\,, A C f2. Therefore, by assumption (|i| in Theo¬ 
rem [TTTJ 

n n n 

EW = ^El? = £P( ff (0 G Al) = 5>(4) = a(h(t)). 

i—1 i—1 i—1 

Now we assume t G U and estimate EX? using assumption ([ii| of Theorem 

n 

ex? = J2 EY i + EY i ■ y j 

*=1 

<t + ^P(g(i) € A$,g(j) € At) 

n 2 

< t + g(Al)j =t(l + Cat), 

i =1 

where we used that, by definition, a(h(t)) = t for any t G Z7. □ 


t 


Proposition 3.3. For all t G [0, n], 

P(X t > i/2) > 

V * “ ' ’ ~ 4 + 4 C G t 

Proof. First, we assume that t G U. Then, Paley-Zygmund’s inequality (Theo¬ 
rem 2.11 in combination with Proposition 3.2 and the choice 6 = 1/2 imply the 


desired inequality. If t G U c , define uq '■= inf {it G U : u > t}. Hence, property ( |iv| ) 
of Lemma |3.1| implies X t = X Uo . Therefore, 

u 0 


P(X t > i/2) = P(X Uo > i/2) > P(X U0 > uq/2) > 
Since uq > t and the function s 


4 + 4Cguo 

s/ (4 + 4 Cqs) is increasing, the result follows. □ 


Corollary 3.4. Fort G [0,n], we have 

P(X,>l)>mi„P ‘ }. 


Proof. If i < I/Cq, we obtain from Proposition |3.3[ the fact that X t takes only 
integer values, and because C G > 1 


P(X t > 1) = P(X t > i/2) > 


t ^ t 
4 + 4 C G t ~ 8' 


(3.2) 


If t > 1 /Cg, we get from Proposition 3.3 and since X t takes only integer values 


P(Xt > 1 ) > P(X 1/Cg > 1 ) = v(x 1/Cg > 2 ^) 


> 


8C g 


(3.3) 
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Combining (3.2) and (3.31 concludes the proof. 


□ 


As a matter of fact, we will use this corollary in the form 

t 1 


P(A' t > 1) > minj-, 


8C g 


> 1 ), 


1 < i < n. 


Corollary 3.5. Let k £ N with 1 < k < n. Then, for t £ [2k, n], 

rfx > s a 2uW 

Proof. This is a direct consequence of Proposition |3.3[ 
Corollary 3.6. For all k £ N in the range 1 < k < 1/2, 


□ 


Proof. Let k be an integer in the range 1 < k < 1/2. Using Proposition 3.3 with 
t = 2k, we obtain 


Ek-maxlft/n > E( k-max X 2k > k') 

l<i<n v J v l<i<n y ' 


=nx 2k >k)> 


2k 


> 


i 


4 + 8 C G k ~ 2 + 4C G 


□ 


3.2. Reduction to Boolean functions. In this section, we e stimate the expres 


sion f„ l k-max \a(i,g(i))\ dP(g) occurring in Theorem 

^ l<i<n 


l.i 


for general matrices 


a by the same expression with a replaced by some averaged matrix a. In order to 
begin our investigation, we first have to give a few definitions. For a measurable 
function / £ we set 


M. 


f*{s)ds-l hW and f t := t £ [0, n\. 


(3.4) 


Observe that both / £ and ft £ M,- Moreover, we write 

l 

Sk{f)(g) ■■= k-max f(i,g{i)) and S(f){g) := ^ S k (f)(g) 

1 < i < n ' ■* 


k =1 


for all / £ £/b and g £ G. Then, for any k £ N with 1 < k < l, 

ES k (a t ) = P (Sk(at) = 1) = P(X t >k), t £ [0,n] 
and, using also the equation in Proposition |3.2[ 


ES k (St) = mm{E „ XtJ} ¥(X e >k), t £ [0,n], 


(3.5) 


(3.6) 


where at = (aj)~, i.e. we first apply the operation - t and then the operation ~ 

We first establish our result for the special functions a t in Proposition |3.8[ which 
will then allow us to prove (cf. Subsection 3.3) the same result for general functions 
a in Theorem 13.91 


Proposition 3.7. For all t £ [0, n] we have 

ES(a t ) < (6 + 12C g ) • ES(at). 
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Proof. First, assume l=\. Then, equation (3.6) and Corollary 3.4 yield 
ES'(dt) > min{EA' t , l}P(Xi > 1) 

> min{EA t , 1} > -^-¥(X t > 1) = -^-ES(a t ), 
o U(? o LsG oOg 


where we used (|3.5|) in the latter equality. Second, assume that £ > 2. Due to 

t/i 


equation (3.6) we have 

min{EX t , £} 


E S(a t ) = 


ZP(X, >k)> min| f-’ (,> XP(X, > k). 


k =1 


k= 1 


Then, Corollary 3.5 and (3.5) give us 


(6 + 12 Cg) * E S(at) > minjEXt, £} > P(X* > k) = ES(a*), 

fe=i 

where we used that 

n 

Ei t = yp(i t >A). 


□ 


fe=i 


Proposition 3.8. For all t £ U, we have 

ES(ot) < (8 + 16C G ) • ES(a t ). 


Proof. Combining (3.5) and (3.6), we see that it is enough to prove the inequality 

i t 

- Y > k) < (8 + 16Cg) • 51 > k) (3.7) 

k=1 k =1 

for t G U D [0,4 

First, we assume that t < 2. Then, Corollary |3.4| implies 

t ■%. ^ ^ ( 8 i 

jY^ Xt >k)< t¥(X t > 1) < tmax j-, 8C G jP(A' t > 1). 


k =1 

Since t < 2, we further get 

P 

t 
1 

k=l 


Y p ( x e >k)< 16C G P(A t > 1) < 16C G ^P(A t > k), 


(3.8) 


fc =i 


which implies (3.7) for t < 2. 

Second, we assume 2 to < t < 2(m + 1) for some 1 < m < £/2. In that case, 
Corollary |3.5| yields 


f t t 

^ fc ) ^ - E p ( x * > k ) < - E( 2 + 4C G )P(A t > k). 

v TYl ?T1 

/c=l fc=l fc=l 

Using the inequality t < 2(to + 1), we conclude 

£ m i 

- p E p (^ > *0 < (8 + 16 Co) ■ E P ( A '* ^ k ) < (8 + 16C g) ■ E P ( A * ^ 


fc=l 


fc=l 


k =1 


which is (3.7) for £ > 2. Combining the latter with (3.8), the proof of the proposition 
is completed. □ 
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Theorem 3.9. Let a : S —> [0, oo) be an arbitrary measurable function on S. Then, 

6 + \ 2(J ' E5(o) < E S{a) < (8 + 16C G ) • ES(a). (3.9) 

Proof. Defining the ^"-measurable function Uk : G —)• [0, n] by 

Mfc(s) := inf{t : X t (g) > k} = inf{t : | h(t) n g\ > k}, 

we first show that d(f(u k (g))) is well defined for all d £ where f(t) := 
& _1 (6*(t)), and that this expression actually satisfies 

Sk{d)(g) = d(f{u k (g))), d£i 4- 

For the definition of see Observe that u k (g) is the unique number such 

that 

\h(u k {g) - e) riff| < k and \h(u k (g) + e) D g\ > k 
for all £ > 0 and, additionally, 

f(uk(g)) = P| h{u k {g) + e) \ h(u k (g) - e). 

£>0 

Hence, there exists an element (*o><?(*o)) £ f( u k{g)) satisfying 

b(io,g{io)) = S k (b)(g). 

Furthermore, observe that the definition of / implies 

b{y) = b*(u k (g)) for all y £ f(u k (g))- 


(3.10) 

(3.11) 


A consequence of the definition of .e/f, is that (3.10) and (3.11) imply 
d(io,g(io)) = S k (d)(g) = d(y) for all y £ f(u k (g)). 


Therefore, (3.11) together with the definition of s^b gives us that do f : u k {G) —► 
[0,oo) is a well defined and decreasing function, thus measurable. Observe that by 
changing variables we obtain 


ES fe (d)= / d(f(u k (g)))dP(g) = 
Jg 

We will now show that 


uk(G) 


d(f(z))dP Uk {z). 


Iu k (G) 


d(f(z))dP Uk (z) = 


lu k (G) 


d*(z)dP Uk (z). 


(3.12) 


Without loss of generality, we may assume that d is a simple function of the form 


d = ^2djl 

3 = 1 


Di 


with (dj) decreasing and { D 7 } a disjoint collection of measurable sets. Observe 
that in this case 


d* -^djl E i-i 




(3.13) 


3=1 


and (3.12) becomes 
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Thus, it is sufficient to prove 

c(G) n{z:{b = b*(z)} C Dj} = u k (G ) n [53 a(A), £ CT (A)) • (3.14) 


_ i-i 




On the one hand, let z £ u k {G) 0 ^(A), Xa=i c(A)) , hence, rf*(ar) = dj. 

Observe that since d £ A we have 

{y : &(y) = & *A>} c {y : d (y) = A-)} = A- 

On the other hand, let z £ u k (G) be such that {y : b{y) = b*(z)} C A'. Note that 
there exists a unique index jo such that 

{y : b{y) = b*(z)} C {y : d{y) = d*(z)} = D jo . 


Observe that (3.101 implies that u k {G) C {z : f(z ) ^ 0}, hence {y : b{y) = 
b*{z)} 7 ^ 0- Thus, the disjointness of the {A} implies j = jo- Therefore, we 

obtain from (3.13) that z £ o"(A)i Xa=i cr(A)) - This proves (3.14) and 

consequently (3.12). So far we proved that 


ES fc (d) = 

Therefore, setting P. 

ES(d) = 


d*(z)dP Ufc (,z), d £ A- 


Mfc(G) 

... we obtain 


'[0,ra] 


d*(z) dt'(^), d £ A- 


Recalling (3.4) we observe that 

E5(a) = - f a*(s) ds • i/[0, cr(/i(£))). 

Having now introduced the necessary tools for the proof of ( |3.9[ ), we first proceed 
by proving the upper estimate. Observe that with C := 8 + 16Cg, we can write 


CES(a) - ES(a) = 


'[0,7 


a*(x) cIt(x) 


(3.15) 


with the signed measure dr(cc) = Cdis(x) — Ho.^OU))) where y is the 

Lebesgue measure on [0, £). We have shown in Proposition 3.8 that 

minit *L\ 

Cu[0,t) = CES(at) > ES'(dt) =- ^ ’ - v[ 0, t £ U, 

i.e., r[0, t) > 0 for all t £ U. We will now show that r[0, t) > 0 for all t £ [0, n\. To 
this end, let t £ U c . Define uq = inf{u > t : u £ U} and note that uq £ U, since 
U is closed by Lemma 3.1 Observe that by (§ of Lemma |3.1| (t — s, uq) C U c for 
some e > 0. Hence, H of Lemma |3.1| implies that b* is constant on (t — £,uo), 
which means by definition of h that h(t) = h(uo). As a consequence we obtain 

Cu[0,t) = Cu[0,u o ) > mm{ J°’ n ^[0 Mhm, 


i.e. r[0,t) > 0 for all t £ [0,n]. Applying Lemma 2.4 to the right hand side of (3.15) 
we obtain 

CES(a) - ES(o) > 0, 

which concludes the proof of the upper estimate. 
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The proof of the lower estimate in (3.9) follows along the same lines by just 


employing Proposition |3.7| instead of Proposition |3.8| and using an appropriate 
signed measure different than r. 

□ 

3.3. Conclusion. As we have seen, we can reduce the case of general a to multi¬ 
ples of functions only taking values zero and one. We now use this fact to prove 
Theorem o 

Proof of Theorem o First we prove the lower estimate. Observe that Theo¬ 
rem 3.9 and the definition of a yield 


Ci ■ E > k-maxa(i,g(i)) > E > k-maxa(*,g(i)) 


k =1 


k =1 


l<i<n 


= -J a*(£) dt.(E^k-maxl, w ), 
where Ci = 8(1 + 2C G ). If i = 1, then Corollary 3.4 implies 

CiE max a(i,g(i)) > / a*(£)d£-E max 1 h(i){h dO)) 

l<i<n Jq l<i<n v 

> [ a*0)d£-P(Xi > 1) 

Jo 

2 8 hi 

For (. > 2 we use Corollary |3.6| and see 

* 1 rt V* 

Cl ■ E V'k-maxa(?:, 5 (j)) > - / a* (£) d£ • (E V' k-max thU) (i, gif)) 

L ' l<i<n £ In V z ' l<i<n 

k =1- * /u k=l - 


(3.16) 


> 


6(1 + 2C g ) J o 


a*(t) d£, 


which proves the lower estimate. 

Now, we proceed with the upper estimate. For all £ > 1 we have by Theorem |3. 9 


ci - E> k-maxa(i,s(*)) < E > k-maxa(i, g{i)) 

• ^ 1 r> ■ ^ 1 <'■>’<' r> 


k =1 


k=l 


l<i<r 


k =1 


= JJ a *(*) d£ • E ^ k-max {i,g(i)) 

< f a*(£)d£, 

Jo 


where ci = 1/(6 + 12C G ). This concludes the proof of the theorem. 


□ 
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4. An APPLICATION TO ORLICZ SPACES 

We will present an application of our main result (cf. Theorem [Ll]) dealing with 
averages of order statistics on random sequences. The expressions for the bounds 
on the expectations that we obtain for < 0 > are in terms of Orlicz norms and 
rather simple (cf. Theorem |1.4[ ). Note that by our “direct” approach, we recover 
Corollaries 2 and 3 from [7\. 

We will estimate the following expression: 


E > k-max I x^XA. 

^ Ki<n 
k= 1- 

where X-] ,..., X n are independent copies of a random variable X : (T2, /u) — > K. with 
E|A| < oo. Those expressions were already studied in [7]0. There, the argument 
in the proof is built upon an estimate involving only the largest order statistic and 
combinatorial results that were obtained in main. However, with Theorem EH 
problems of this form can be approached directly. 

Recall that M* in Theorem |1.4| is defined by 

0 < /3 < 1 . 

The following Lemma is a continuous version of Lemma 2.1 in m and our proof 
follows along the same lines. 


Lemma 4.1. Let 


B = conv 




= ±i,y^Qj 


=i 


Then we have 


B C B m * C 3 B. 


(4.1) 


Proof. First, we show the left inclusion. Let z £ B. Then 

n 

X*(y)dy) =Y, a '/ e=1 - 

2=1 

To show the other inclusion, let Z\ > • • • > z n > 0 and 

n 

E M *(^) = 1 - 

2=1 




M*{\z i \) = Y J M* 


We write z = z' + z" = (zi,... ,z r , 0 ..., 0) + (0,..., 0, z r + i, ..., z n ), where r is 
chosen such that M*(zi) > 1 /n for all 1 < * < r, and M*(zi) < 1/n for all 
r > i + 1. 

We have 



Therefore, z" < (\^ n X*(y) dy,..., J^ n X*(y) d y) =: w £ K n . Since w £ B, we 
also have z!' £ B. 

It is now left to show that z' £ B. There exist indices ki > 1 for 1 < * < r such 


that 


— < M*(zi) < 


ki 1 


n 


n 


(4.2) 
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•iki/n 


Since 

r j r 

i=1 i =1 

and YJi=i £ki/n < we immediately obtain 


X*(y)dy) <J2 M *(zi) < 1, 


a £k\/n ptKr/n 

X*(y)dy,...,j X*(y) dy, 0..., Oj G B. 


£k r /n 


Using (4.2), we see that 

/n 


/0 

2ik\jn 


ptKi/Tl n£K r /n 

2z ={ 2 J o X*(y)dy,...,2j X*(y) dy, 0...,Oj 

p2£k r /n 

X*(y)dy,...,J X*(y) dy, 0...,Oj > A 


Thus z' G 2B. We conclude that z G 3 B. 


□ 


Proof of Theorem \1.4\ We will assume that the independence of Xi, ..., X n is real¬ 
ized through n factors with G = £2” and X^g) = X(g(i)) for g G G and a canonical 
random variable X with the same distribution as A'i,..., X n . This first means that 


eA^ k-max = Eq k-rnax \xiX(g(i))\. 

1 <i<n l<i<n 

k =1 k=1 

Defining a : {1,..., n} x £2 — > K. by 

a(i, ui) := XiX(u), 

and setting P = 0" =1 /i, we obtain by Theorem 0 

t * z 

E k-max |xiA'i| = / k-max |o(i, g{i)) | dP(y) ~ / a*(t)dt. 
±<i<n JG,, t<i<n Jq 


k= 1 


Observe that we also have 


n£ Tt POti 

/ a*(t)dt= sup / X*(t)dt, 

J o J2 «i=< i—i do 

by approximation of A' with simple functions. Therefore, 

t n » ai 

EVk-maxl^WI- SU P T x i / X*(t)dt. 
^' l<i<n In 

k =1- 2^ a *-*i =1 


With B as in (4.1), using Lemma 4.1 and (2.1), we further obtain 


Ey^k-maxIcCiXil ~ sup y Xiyi ~ sup y ~ ||x||m- 

i<i<n yGext B y£B M , “ 


fc=1- 

This concludes the proof. 


□ 
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5. An APPLICATION TO THE LOCAL THEORY OF BANACH SPACES 


In this last section we present an example of a set of maps with a minimal 
number of elements satisfying conditions ([!]) and (JnJ) in Corollary 1.2 This is then 
used to embed certain Orlicz sequence spaces i^ into using the “standard” 
embedding, which usually provides an embedding into £™' 2 . 

Recall that given two normed spaces X , Y and some constant C > 1, we say that 

Q 

X C-embeds into Y and write Isf if there exists a one to one linear operator 
T : X —» 'F(X) C Y such that ||\E'|| • ||'I' _1 || < C. Given two isomorphic Banach 
spaces X and Y the Banach-Mazur distance of X and Y is defined as 

cLbm{X, Y) = inf {||T||||T -1 || : T £ L(X,Y) isomorphism} . 


Before we continue, let us give some historical remarks. The problem, given 
an n-dimensional subspace X of Li([0,1], dx) and e > 0, what is the smallest 
N = N(X,e) such that there is a subspace Y of with djjM {X, Y) < 1 + e, 
is extensively studied in the literature. A first breakthrough was made by G. 
Scheclitman in [28], proving that 

N < log(e _1 ) -n 2 , 

£ z 

C > 0 being an absolute constant. 

Based on his work, in [2] the bound in the dimension was improved to 

C 

N < — log(n£ _1 )(logn) 2 • n 

£ z 

Later, M. Talagrand in m proved that for any 0 < £ < £o 

N < CK{X) 2 £~ 2 ■ n, (5.1) 


where K(X) denotes the A-convexity constant of A (cf. [SU]). Recall that G. Pisier 
proved in {22] that if X C Li([0,1 ],dx) and diiri(A) = n, then K{X) < C^Jlog n. 
Thus (5.1) improves on previous results and gives 

N < Ce~ 2 log(n)n. 


(5.2) 


For further information we also refer to the work of W. B. Johnson and G. Schecht- 
man JT2], A. Naor and A. Zvavitch [21], J- Bernues and M. Lopez-Valdes, O. Fried- 
land and O. Guedon, as well as the references therein. 

Now, recall that in [3], using the theorem of de Finetti, J. Bretagnolle and D. 
Dacunha-Castelle proved that an Orlicz space £m is isomorphic to a subspace of L\ 
if and only if M is equivalent to a 2-concave Orlicz function. The corresponding 
finite-dimensional result was proved by S. Kwapien and C. Schiitt in (Till I5U] . In 
combination with the previous results, this shows that G-embeds into 

The proofs of the results mentioned above are very involved and quite technical. 
The embedding we present here is specific and rather simple, with an offset in the 
dimension N. 


5.1. A minimal set of maps. Let n be a power of a prime number and let F„ 
denote the field with n elements. We define Iq = Ho = F„ and denote by fiQ 
the probability measure on fl 0 defined by /ro({*}) = for all i £ F n . If we set 
Go = {gem : £, m £ F„}, where ge m (i ) = ii + m and multiplication and addition 
is performed in F n , then the probability measure P 0 on G 0 given by P 0 ({g}) = Aj-, 
for all g £ G 0 satisfies conditions (|i]) and ([n]) of Theorem 1 1.1 1 with Cq 0 = 1, that is 
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(i) P o(g(i) = j) = for all (i,j) £ I 0 x fl 0 , 

(ii) Po(5(ii) 2 ) = ji) < ^ 2 , for all / (* 2 ,^ 2 ) in I 0 x O 0 . 

We want to point out that the above G 0 consists of n 2 elements. 

We first prove condition (i) and let (i,j) £ Iq x fig. Then, for an arbitrary 
i £ F„, there exists exactly one m £ F„, which is given by m = j — £i such 
that ge m {i) = ii + m = j. Therefore, condition (i) is satisfied. For condition 
(ii) we note that for all different tuples (*i,ji), (i 2 ,j 2 ) £ lo x fio> in order to have 
= £ii+m = j 1 and gemfa) = £i 2 +m = j 2 for some £, m £ F„, it is necessary 
that i\ 7 ^ j 2 and in this case £ is given (uniquely) by £ = (j 1 — J 2 )(*i — * 2 ) 1 and 
m = ji — £ii = j 2 - & 2 - Therefore, the event {g £ G 0 : g(i 1 ) = ji,g(i 2 ) = j 2 } 
consists of at most one element and the definition of Po implies condition (ii). 

In general, we have the following result. Let n £ N, define ii = fii = {1,..., n} 
and set A*i({*}) = for all 1 < i < n. If G\ and Pi satisfy conditions (|ij) and @ of 
Theorem 1 1 . 1 1 for some constant C Gl > 1, then G\ consists of at least elements. 

Indeed, assume that conditions (i) and (ii) are satisfied with some constant 
C G 1 > 1 and assume that G\ consists of less than n 2 / C Gl elements. Then, there 
exists at least one element g\ £ G\ such that P({</i}) > C Gl /n 2 . Since, for the 
choice j 1 = 5 , i(«i),j 2 = 91 ( 12 ) and arbitrary different ii,i 2 £ F„, this g is an 
element of the event {g £ G 1 : g{i\) = ji,g{i 2 ) = J 2 }• Therefore, by (ii), we get 
the contradiction 


% < P({ ffl }) < P({<? £ G 1 : g(ii)=ji,g(i 2 )=j 2 }) < % 

n z n~ 


This shows that, up to a constant factor, the set of functions Go has the least 
number of elements satisfying conditions (|i| and ([II]) of Theorem |1.1| 


5.2. Embedding P}^ into . As an application to Banach space theory, we will 
now apply Theorem o to Iq, O 0 , /r 0 , Go and Po, defined as above, and prove 
Theorem |1.3[ We start by explaining the rough idea before going through the 
details. Let M be a strictly convex, twice differentiable Orlicz function that is 
strictly 2-concave. We will show that the Orlicz sequence space Pfo G-embeds into 
P^ 1 , where c and G are absolute constants independent of n and M. This should 
be compared with the “standard” embedding of P^ into £” !2 ". Recall that the 
standard embedding (cf. [50]) is given by 


* • f n _s. p n ' 2rl 
■ r M ^ t l 1 


^ n!2 n ( S e ’ i0 ’ r W a: * ) ’ 


^2 = 1 


where a = a(M) £ M" is chosen in such a way that it generates the Orlicz norm, 
i.e., 


a S D« 

7T £&n \i=l 


1/2 


"K (i) 


- x M- 
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Indeed, using Khintchine’s inequality we then obtain that 


ll^n(a:)||i 


1 

n!2" 


n 

7 r,£ i—1 


1 

n! 


E 

re©n 


Mm- 



1/2 


So the standard embedding combines Khintchine’s inequality with an average 
over the symmetric group ©„, which explains the dimension n!2 n (see also (291 
EH HU EH for embeddings of other types of spaces into L\). Instead of taking an 
average over the whole symmetric group, we rather use our minimal set of maps Go, 
which has cardinality n 2 only, thus obtaining an embedding into 2 .To further 
decrease the dimension, we will then make use of a result due to J. Bourgain, J. 
Lindenstrauss, and V.D. Milman that allows to use only cn sign vectors instead of 
2 n . 

Now let us be more precise. To find the sequence (ai)” =1 of scalars that generates 
the Orlicz norm, we use the following result due to C. Schiitt (cf. [50) Theorem 
2]): if M is a strictly convex, twice differentiable Orlicz function that is strictly 2- 
concave and satisfies M*( 1) = 1, then there exists a sequence a\,... ,a n of scalars 
such that for all x £ R", 

- c \\ x\\m < ^ (El^^wl 2 ) 7 < c ll x IUf, (5-3) 

TT&n i—1 

where c is a constant that does not depend on n and M. As a matter of fact, in 
pIUl Theorem 2] an explicit formula for the choice of a is given. 

J. Bourgain, J. Lindenstrauss, and V. D. Milman [T] proved the following: if 
v\,...,v n are unit vectors in a normed space (R”, || • ||), then, for all S > 0, there 
exists a constant C(6) >0 and N = C(5)n sign vectors e 1 ,...,^ £ {—1,1}" such 
that for all x £ R" 


N 


(1 — (5) Ave | 


i=l 


E ±x * v i ^ x E 


e^XiVi 


j =i i =i 


< (1 + 8) Ave 


E ±Xi 


Vi 


i= 1 


(5.4) 


For our purpose, it is enough to know that in the setting iq = i >2 = • • • = v n = ei, 
where e\ is the first standard unit vector of R”, || • || = || ■ ||i, and say 6 = 1/4, there 
exists a choice of N sign vectors that satisfy (5.4). 

The last ingredient is a special case of a result we recently obtained in m 
Theorem 1.4]) and reads as follows: let n £ N, a £ R nx ", and 1 < p < oo. Let G 
be a collection of maps from / = {1,..., n} to / and Cq > 0 be a constant only 
depending on G. Assume that for all i,j £ I and all different pairs (* 25 / 2 ) G 

lx I 


(i) P({ ff £ G : g{i) = j}) = 1/n, 

(ii) ¥({g £ G : g(i 1 ) =j 1 ,g(i 2 ) = j 2 }) < C G /n 2 . 
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Then 


C 


1 n n 

;E»W + (- E 


1/p 


k =1 


fc=n+l 


< E 


\ a ig(i) 

i=1 

i n n 2 i 

^E»®+G E *(*)')"> 


fc=l 


/c=n+l 


where (s(fc))^ =1 is the decreasing rearrangement of {| : i,j = l,...,n} and 
C > 0 is a constant only depending on Cq- 

Note that the conditions in the theorem are satisfied for G = Go, as was shown 
above, and for G = & n (cf. [TB] Example 1.2]), although with different, but still 
absolute constants. This means that, especially for p = 2, 

^2 E (X>hKdl 2 ) ' (EK(i)l 2 ) ' ( 5 - 5 ) 

g£Go i—1 7rG©n i—1 


for all a G R nxn . 

Let us now prove the embedding result. 


Proof of Theorem m Let Go be our minimal set of maps. We define the isomor¬ 
phism fn by 


* n : e 


M 


£ Cn 


X I ^ 


1 


Cn 3 


E £ i a s( 


i)%i 


V *— 1 / geGo,j=l,--,Cn 

Then a direct computation as shown in the standard embedding, now using equa¬ 


tions (5.3), ( |5.4| ) (in the setting mentioned above), and (5.5), shows that 

Il'MaOlli - \\ x \\m- 


This means that there exist absolute constants C, C\ >0 such that for all n G N, 
I'm Ci- embeds into t , where C, Ci > 0 are independent of M. □ 
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